Skip to content

Conversation

@dhruvachak
Copy link

This envar acts as an override blocksize to be used for Xteam reduction kernels. The default is 0 (unused).

This envar acts as an override blocksize to be used for Xteam reduction
kernels. The default is 0 (unused).
@dhruvachak dhruvachak requested review from Kewen12 and ronlieb November 4, 2025 02:06
@z1-cciauto
Copy link
Collaborator

Copy link

@Kewen12 Kewen12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good!

Copy link
Collaborator

@ronlieb ronlieb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

env var BLOCKSIZE=256 on mi2xx shows the improvement 86 -> 63 seconds

and ADJ=1 96 seconds ADJ=0 66 seconds

PR is a keeper, after it lands, i will add it to the table

@ronlieb ronlieb merged commit a7ba6de into amd-staging Nov 4, 2025
6 checks passed
@ronlieb ronlieb deleted the amd/dev/dhruvachak/xteam_blocksize_envar branch November 4, 2025 16:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants